DNA DENATURATION AS A NEW KIND OF PHASE TRANSITION 
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Abstract 

Unbinding of a double- stranded DNA reduces to an unscreened long range interaction 
q and maps on various problems. Heterogeneity renormalizes interaction. 

O Renormalization is temperature dependent. At an unbinding transition it approaches 

^ critical dimensionality. This implies giant non-universal critical indexes and 

^ invalidity of the Gibbs distribution sufficiently close to the critical temperature T c . 

Q Fluctuations are macroscopically large below T c . There are no fluctuations above it. 

^ PACS Numbers: 87.14.Gg; 64.60.Fr; 05.70.Gg; 64.60Fr 
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Thermal unbinding (melting, coiling, denaturation) of a double- stranded DNA 
molecule is biologically important and physically unique. It yields a phase transition 
in a one-dimensional system. 1 The system is extraordinary long - the total length of a 
single mammalian DNA is 1.8m, it consists of ~5 billion nucleotide base pairs. Their 
sequence is related to genetic information, yet statistically it is close to a random one. 
The fraction of unbound base pairs as a function of temperature ("the DNA melting 
curve") is proportional to DNA light absorption at about 260nm. DNA denaturation 
maps onto a variety of other problems: the binding transition of a polymer onto 
another polymer, a membrane, or an interface; 3 wetting in two dimensions; 4 
dep inning of a flux line from a columnar defect in type-II superconductors; 5 
localization of a copolymer at a two-fluid interface. 6 DNA denaturation has been 
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extensively studied for nearly four decades. " ~ Yet, some features of this transition 
were overlooked. Start with its physics and model. DNA nucleotide base pairs 
(adenine - thymine AT, guanine - cytosine GC) are large ("mesoscopic") organic 
molecules. Their unbinding releases few thousand degrees of freedom. The 
corresponding entropy is sk B per site (k B is the Boltzmann constant, s ~ 10). So, 
while the binding (hydrogen) energy of DNA strands is ~ 3000°K, DNA melts at a 
relatively low room temperature (~ 300°K), i.e. in the vicinity of the ground state. 
The Poland-Scheraga model 1 of DNA melting introduces the fusible AT and 
refractory binding energies Ei = -sk B Ti and E2 = -skBT2 correspondingly (Ti < T2), the 
boundary energy J per bound segment (J ~ 3000°K accounts for an incomplete 
unbinding at the boundaries), and the loop entropy -ck B ^nL per an unbound segment 
(L is the total number of nucleotide pairs there). The value of the constant c may 
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vary ' ' " from 1.5 to slightly higher than 2. Thus, at the temperature T, Poland- 
Scheraga Hamiltonian E^ of the adjacent bound and melted segments is related to 

the length i and the GC concentration x in the former and to the length L in the latter. 
Calculated from the energy -sk B T per site of a completely melted DNA (T is the 
temperature), 

E^lx = sk B £5T + J + ck B T^nL - sk B £(x - x)AT. 
5T = T-T, T = T!X + T 2 (l-x), AT = T 2 -T P 
where x is the AT concentration at an entire DNA. Parameters T ~ 31 OK, AT ~ 40K 
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depend on the DNA solution. Start with the well known case of a homopolymer. ' ' ' 
There x = x, the last term in Eq. (1) is missing, and E f Lx = E(£,L) depends on i and L 
only. Then an entire Hamiltonian H = XE(^ (n) ,L (n) ), describes an ideal gas of bound 

n 

and unbound segment pairs (£ (n) ,L (n) ). It relates the free energy f per site to the 
normalization condition for the Gibbs probability p f L of given t and L: 

QO 

p lL = exp{-(f+L)^ - E(^,L)/k B T} ; I p fL =1, <|> = -f/k B T (2) 

C,L=1 

When (|> « exp(-J/k B T), Eqs. (1,2) yield 

co 

|exp(-L^)L^ c 'dL = (4) + T)exp(J/k B T); t = s5T/T; c 1= c-1 (3a) 
1 
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Consistent with the Landau-Peierls theorem for the Hamiltonian (1), when ci> 1, Eq. 
(3a) yields phase transition. Then, by Eq. (2), (j) = does not allow for any 
excitations of a completely melted polymer. This is specific for the Hamiltonian 
which depends on inL only - when L = oo, any excitation would imply infinite energy 
increase. Dependence on inL yields other unusual implications also. Transition is 
non-universal - its critical indexes depend on ci. Immediately below the critical 
temperature 1 ' 3 ' 7 ' 8 T c , 

4> = 0ifci>l; ^~-9£n9ifci = 1; § ~ 9 1/c ' if 1 > ci > (3) 

9 = (T c - T) / (T c -T); x c =(T C — T)/T = (l/sc 1 ) exp(-J / k B T); 

As anticipated, the critical ci = 0, while J/keT-T/AT ~ s ~ 10 implies, by Eq. (3), a 
very narrow width of the transition ~ (T c - T) / T ~ 10 5 (i.e. T c - T ~ 10~ 3 K), and its 

very close proximity to the ground state melting temperature T. Once the free energy 
(3) is known, the Gibbs probability (2) allows one to calculate any thermodynamic 
averages and fluctuations. The average (denoted by a bar) relative number co = £/h 
of the melted sites, which is measured via light absorption, the average length L of a 
melted segment, and their relative mean squared fluctuations Aco / co, AL / L are: 

co ~ Cj exp(J / k B T) » 1; Aco / co ~ 1 

L~lifci>l; L-f 1-1 ifci< 1 (4a) 

AL/L ~ 1 if ci > 2; AL/L ~ fy°- 5ci ~ l if 2 > ci > 1; AL/L ~ ^ 05c ' if 1 > ci > 

Thus, Aco / co, AL / L are never small, while AL / L — > oo when T — > Tc and ci < 2. A 
more physically meaningful fluctuation is 

AWco = I co - co I / co ~cT c ' «1; A*L/L = IL-D/L~1 (4b) 

It demonstrates, in particular, that a characteristic IL-LI- L implies a characteristic 
\£nL-£nL\~ 1, i.e., « £nL, when ci < 1 and L — > oo. 

Consider heterogeneous DNA. When temperature increases from T to 
T + 8T, the Poland-Scheraga Hamiltonian (1) complements the energy increase of an 
"average" bounded segment (the first three terms) with the energy decrease of a 
refractory bounded segment (the last term). I prove that in the vicinity of the DNA 
melting temperature, the last term may be replaced with its thermodynamic average 
for given lengths of the successive bound and unbound segments. (Such replacement 
is equivalent to an unusual mean field approximation, which becomes accurate at the 
phase transition and which technically reduces to a constrained summation in the 
partition function). The resulting Hamiltonian describes a homopolymer with the 
renormalized loop entropy. The renormalization, and thus the phase transition 
singularity it determines, are non-universal and depend on the DNA parameters. 
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Physics of DNA melting was elucidated in ref. 8. (All following statements 
are later accurately verified). The segments, rich in the fusible AT, melt first, while 
the richest in the refractory GC melt last. When c > 1 in Eq. (1), bounded segments 
completely vanish at a finite critical temperature T c , where L — > co and the effective 
boundary energy (J+ckBT£nL) — > qo. Then the excitation energy also — >oo, DNA 
approaches its ground state, and fluctuations of t and x vanish. When T — > T c , the 

o 

length of a ground state (i.e. sufficiently refractory) bounded segment is cc InL — > qo, 
to compensate the effective boundary energy in Eq. (1). Sufficiently close to T c , i 
exceeds any finite correlation length, and the probability w(£,x) of a given x at such t 
is Gaussian. Since fluctuations of I and x vanish at T c , it is ~ the thermodynamic 
probability IFL of a bounded site. So, I and x yield L(t,x)~ Thus, the values 

of I and L at a given temperature determine the corresponding value of x according to 
w(£,x)~£FL. The Gaussian w implies ~JI(x — x) qc -j£n[l I w(4 x)] » 1, where the 

factor in w ~ IIL may be disregarded with negligible error. Thus, x - x in Eq. 1 may 
be replaced with its thermodynamic average according to 



In fact, large McbT ~ T/AT ~ s ~ 10 allow for Eq. (5a) already slightly above T. 
Equation (1), complemented with the unusual mean field approximation (5a) for x, 
yield the renormalized Hamiltonian E (t,L), which depends on the variables I and L 
only. In the leading (in IIL « 1) approximation it equals 



The last refractory term accounts for the thermodynamic average of x for given values 
of £,L,x and AT in the Poland-Scheraga model for a heteropolymer. The 
Hamiltonian describes a "renormalized" homopolymer, and Eq. (2), where E(£,L) is 
replaced with E (t,L), yields its exact free energy. The competition in Eq. (5) of the 
energy increase and decrease, correspondingly in the "average" first and last 
"refractory" terms, yields a high and relatively narrow E (i,L) minimum at the ground 
state i = £ m = 0.5(DAT/5T) 2 ^nL (which is indeed <x toL as stated earlier). The 
expansion of E (l,L) in l-l m non-universally decreases the factor c in the loop entropy 
by s(DAT)"/2T5T, and Eq. (2), with E replaced with the expanded E , after a 
straightforward calculation, yields 



£/L = (^/27tD 2 ) 1/2 exp(-u 2 ); u 2 = £(x-x) 2 /2D 2 ; D 2 =x(l-x) 



(5a) 



E*(£,L) = s£k B 5T + J + ck B T£nL - sksDAT V 2££nL 



(5) 




(6) 



where 



5 = ci-y; y = s(DAT) 2 /2T5T; 

M = 7T 1/2 (2 Ci r 3/2 (sDAT/T) 2 exp(J/k B T) » 1. 



(7) 



Note that the left hand side of Eq. (6) depends on (j) and 5 only. Thus, Eq. (6) reduces 
five dimensionless parameters (J/T, AT/T, T/T, x, c), which determine § in a non- 
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renormalized case, to two parameters (8 and M). When ci < 1 and (j) « x c -x , Eq. (6) 
maps onto Eq. (3a), where ci is renormalized into the temperature dependent 8 
(which, unlike Ci, may be any sign and which dominates the temperature dependence 
in the vicinity of the phase transition). 

By Eq. (6), (j)>0. Since ci> [1,7,10-12], (j) = is achieved (as stated) at finite 
temperature T = T c . There 

S(T C ) = 5 C = ci9*, 9* = (2k 2 ) 1/3 (T/sDAT) 4/3 exp(-2J/3k B T) 

5T c / T ~ (s / 2Cj)(DAT / T) 2 , §T C = T c - T (8) 

Note that, by Eq. (8), 5T C ~ 3°K. When T c - T « 5T C , then 5 = ci(9* - 9), where 9 = 
(T c - T)/5T C is the relative distance to the critical temperature. By Eq. (4b), 5 « 1 
implies MnL ^8tnL, and thus verifies the derivation of Eq. (5). At T c , by Eq. (8), § c 
~ 0.01, i.e. it is very close to the critical ci = - cf Eq. (3). By Eq. (6), L oc lAj) oo 
when T — > T c . This, and £ ~ l m cc InL verify all previous estimates. When 5,(j) « 1, 
asymptotics in Eq. (6), where M » 1, yield an unusual non-universal singularity: 

i> ~ [(39/47r 1/2 9*)^n(9* /9)] 1/q6 *; when 9 « 9* (9a) 
4 ~ [(29* / kQ) V^n(9/9*)] 3/2c ' 9 ; when 1 » 9 » 9* (9b) 

Consider the implications of Eqs. (7-9b). In natural DNA J/k B T ~ T/ AT ~ 10, D ~ 
1/2, ci ~ 1. So, in the immediate vicinity of T c , where 9 < 9* ~ 0.01, the order of the 
transition, by Eq. (9a), is l/ci9 ~ 100, i.e. giant. The order is non- universal, it 
depends on the DNA parameters Tj,T 2 ,x. The values of Ti,T2 depend on the ligands 
and their concentrations in the DNA solutions [7,8], which may be manipulated 
experimentally. Non-universality in Eqs. (9a, 9b) is related to the competition of the 
refractory and loop entropy terms in Eq. (5), which renormalizes the loop entropy, and 
thus the singularity. The width of the transition (9a) is very small, yet macroscopic. 
The crossover from Eq. (9b) to Eq. (9a) occurs when (T c -T)/T c ~ 10" 4 . Then (T c - 
T)/T c ~ 10" 4 , T c - T ~ 0.01°K (cf 8T C = T c - T ~ 3°K). In the approximation of Eq. 

(6), the probability density P L of a given L is P L = M" 1 (£nL) 1/2 L" 1 " 5 exp(-())L). So, by 
Eqs. (6, 9b), L oc 1 / (j) oc exp[l / (T c - T)] exponentially increases to L~ 10 40 at the 

22 

crossover. Thus, even in a solution with ~ 10 " DNA nucleotide base pairs, all DNA 
molecules completely unbind in the interval (9b). So, at a small, yet matroscopic 
distance ~ 0.0 IK from T c , the effective long range interaction exceeds any 
macroscopic size of the system. The system can no more be divided into weakly 
interacting subsystems, thus the Gibbs distribution is invalid. The fraction of bounded 
sites is correspondingly small there, and the observably quantity is the temperature of 
complete melting of a finite DNA. If its length is N, then L = N at the temperature 
Tn, when 

9 N = (Tc-T N )/T c ~ 1/AiN . (10) 

The mean fluctuation A9n of 9n may be estimated from 
L(9 N + A9 N )-L(9 N ) = A*L(9 N ). Similar to Eq. (4b), A*L ~ L, and thus 13 
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A9 N /e N ~l/ftiN . (11) 

Such fluctuation is macroscopic and easily observable. In DNA this situation is 
related to mesoscopic size of base pairs (which yields large J/1cbT ~ s ~ 10, thus small 
9 ), and to DNA heterogeneity. By Eq. (7), heterogeneity effectively replaces fixed ci 
with 8. The latter decreases to 8 C « 1 (at T = T c ) and scales the transition order with 
1/8 C in Eq. (9a) and with 1/8 in Eq. (9b). This may be characteristic of any 
sufficiently strong long range interaction. 

By Eq. (10), natural DNA always yields 6 » 6 , i.e. the essential singularity 
(9b) in 9 cc T c - T. (This was predicted in ref. 8). By Eq. (8), it proceeds in the 
interval T c - T ~ 0.0 1T C ~ 3 °K. Sufficiently close to T , the length L may reach the 
correlation length of the sequence. Then the distribution w(£,x) becomes non- 
Gaussian. This alters Eq. (5) and the melting curve ())(T). 

Below T DNA is mostly bounded, and only anomalously fusible segments 
melt. Their probability yields the equation which replaces Eq. (5). Their melting 
proceeds in an entire interval AT. Until sufficiently high temperatures the number of 
segments, which melt nearly simultaneously, becomes large, the DNA melting curve 

1 8 

exhibits their successive melting. It is explicitly seen in experiments. ' Thus, in a 
general case there are three distinctly different temperature intervals: 9 ~ 0.01, i.e. T c 
- T ~ 0.03 °K; 9-1, i.e. T c - T ~ 3K; and AT ~ 40 °K. 

A giant order transition (9a) may be observed only when the total number N 
of base pairs is much larger than L at the crossover to Eq. (9b). This implies 
inN > 1/9*. Since, by Eq. (8), 9* ~ 0.005D" 473 , so D must be < 0.03(AiN) 3/4 . On 
the other hand, the derivation of Eq. (6) implied the large renormalized term. At the 
crossover this means D > 0.03. In the interval 0.03 < D < 0.03(£nN) 3/4 non- 
universality of the giant critical index in Eq. (9a) may be studied (e.g., via its 
dependence on AT, which changes together with the concentration of solvents in 
DNA solution 7 ). 

Presented theory may be numerically tested. Once the ground state is 
accurately determined analytically [8], computer simulations allow for the study of its 
fluctuations. 

The approach is applicable to other problems also. 

To summarize. DNA unbinding with temperature proceeds from piecewise 
melting of fusible domains, to essential singularity, to giant (~ 1/9 > 100) order phase 
transition. The latter may be observed when the AT or GC concentration is between 
0.03 and 0.03(^nN) 3/4 , where N is the total number of nucleotide pairs. In the vicinity 
of complete melting the Gibbs distribution is invalidated. 
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